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Cook's [J. Roy. Statist. Soc. Ser. B 48 (1986) 133-169] local influ- 
ence approach based on normal curvature is an important diagnostic 
tool for assessing local influence of minor perturbations to a statistical 
model. However, no rigorous approach has been developed to address 
two fundamental issues: the selection of an appropriate perturbation 
and the development of influence measures for objective functions 
at a point with a nonzero first derivative. The aim of this paper 
is to develop a differential-geometrical framework of a perturbation 
model (called the perturbation manifold) and utilize associated met- 
ric tensor and affine curvatures to resolve these issues. We will show 
that the metric tensor of the perturbation manifold provides impor- 
tant information about selecting an appropriate perturbation of a 
model. Moreover, we will introduce new influence measures that are 
applicable to objective functions at any point. Examples including 
linear regression models and linear mixed models are examined to 
demonstrate the effectiveness of using new influence measures for the 
identification of influential observations. 

1. Introduction. Assessing local influence of perturbing a statistical model 
has been an active area of statistical research in the past twenty years since 
the seminal work of Cook [7]. See, for example, Beckman, Nachtsheim and 
Cook [3], Tsai and Wu [26], St. Laurent and Cook [25], Wu and Luo [31, 32], 
Ouwens, Tan and Berger [21], Pan and Fang [22] and Zhu and Lee [38], 
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among many others. The key idea of the local influence approach is to uti- 
lize the concept of normal curvature in differential geometry (Efron [10] and 
Bates and Watts [2]) in assessing the local behavior of the likelihood dis- 
placement function. Zhu and Lee [37] proposed a generalization of Cook's 
[7] approach based on the Q-function in the EM algorithm to assess local 
influence of a small perturbation to a class of models with incomplete data. 
Zhu, He and Fung [41] developed a local influence method for generalized 
partial linear models for longitudinal data. Lee and Tang [17] examined local 
influence in structural equation models. Zhu and Zhang [40] used measures 
of local influence to assess the extent of discrepancy between a hypothetical 
model and the underlying model from which the data are generated. The 
local influence approach is also useful for sensitivity analysis of missing data 
modeling (Verbeke and Molenberghs [27] and Verbeke et al. [28]). 

The aim of this paper is to construct influence measures in assessing local 
influence of perturbations to a statistical model. Specifically, we address two 
important issues related to the local influence approach: the appropriate 
choice of a perturbation vector and development of influence measures for 
assessing an objective function at any point. 

The first issue, selecting an appropriate perturbation vector, has been 
largely neglected. This issue, however, is central to the development of the 
local influence approach, because arbitrarily perturbing a model may lead 
to inappropriate inference about the cause (e.g., influential observations) of 
a large effect. For instance, when data form clusters (e.g., subjects in lon- 
gitudinal studies and families in genetic studies), then perturbing a cluster 
with more observations likely produces a larger effect. However, to the best 
of our knowledge, no diagnostic methods have ever been developed to take 
into account the differing number of observations in each cluster (see Section 
4 for further discussion). Moreover, because the components of a perturba- 
tion vector may not be orthogonal to each other, special care should be taken 
when we interpret local influence measures from such a perturbation. Thus, 
it is desirable to measure the amount of perturbation, the extent to which 
each component of a perturbation vector contributes to, and the degree of 
orthogonality for the components of a perturbation vector. 

The second issue is the development of influence measures for objec- 
tive functions at a point with a nonzero first derivative. Fung and Kwan 
[11] showed that the normal curvature is not scale invariant and provided 
some examples to illustrate that ambiguous conclusions may be drawn when 
applied to objective functions with a nonzero first derivative. However, 
they did not provide any methodology to address this drawback. Conformal 
normal curvature (Poon and Poon [23]) is invariant under the conformal 
reparametrization at a point with a zero first derivative (called a critical 
point), but it is not scale invariant at a point with a nonzero first deriva- 
tive. These difficulties have limited the application of normal curvature to 
objective functions that have zero first derivative at the critical point. 
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We will introduce a geometrical structure, called the perturbation mani- 
fold, for a perturbation model and use its associated metric tensor and affine 
connection to address the above two issues. The metric tensor of the pertur- 
bation manifold can measure the amount of perturbation and the orthog- 
onality between the different components of a perturbation vector. Thus, 
the properties of metric tensors (e.g., positive definiteness) can be used to 
choose an appropriate perturbation to a statistical model. Furthermore, once 
an appropriate perturbation is chosen, we use the first and second deriva- 
tives of the objective function (e.g., the likelihood displacement function in 
Cook [7]) to construct influence measures. These influence measures can be 
easily applied to any objective function evaluated at any point in order to 
quantify the local influence of minor perturbations to a statistical model. 

2. Perturbation manifold and influence measures. 

2.1. Motivation. Let p(Y\0) be the probability function for an M(n) x 1 
random vector Y T = {Yi ■> ■ ■ ■ i^n)^ parameterized by an unknown param- 
eter vector 6 = (9i, . . . , 9 q ) T in an open subset G of R q . In addition, each 
Yi is an m, x 1 random vector, where Y^i=i m i = M(n). For instance, in 
longitudinal studies, rrii may represent the number of observations in the 
ith cluster. On the basis of the assumed model p(Y\0) and observations 
in y T = (yj , . . . ,yT), we can then carry out statistical inference, such as 
estimation and hypothesis testing. 

Let u> = (cji, . . . , uip) T be a perturbation vector and let u vary in Q C RP. 
If a perturbation vector u, which is introduced to perturb p(Y\0), has a 
large effect, then it is important to know the cause (e.g., influential obser- 
vations or invalid model assumptions) of such a large effect. Therefore, it is 
important to develop statistical methods to quantify the effect of perturbing 
a statistical model and pinpoint the potential cause. 

In Cook [7] a general method was developed to assess the local influence 
of perturbing a statistical model by introducing us into p(Y\0), denoted by 
p(Y\9,u). The proposed methodology is based on the directional curvature 
of an influence graph, which is defined as 

(1) IG(u) = (u T J(u)f, 

where / : RP — > R 1 is a sufficiently smooth (differentiable a certain number 
of times) objective function. Consider the straight line u)(t): u(t) = cj° + th 
in Euclidean space RP and the lifted line IGh (<*>(£)) for any nonzero vector 
h, where u>° is a fixed column vector in RP . The tangent vector and upward- 
point normal vector of the lifted line are, respectively, given by (J!t) and 

(1 + VjV/)- 1 / 2 ^ 7 ), where V/ = (df(u)/du>i) is evaluated at u° and I p 
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is the p x p identity matrix. The normal curvature of the influence graph 
(Cook [7]) is given by 

(2) C h - 1 



(l + VjV/) 1 ^ h T (I p + V/Vj)h' 

where Hf denotes the matrix (d 2 f (to) / dtOi duij) evaluated at a; . The maxi- 
mum value of Ch and the corresponding direction have been widely used to 
assess the effects of using u)(t) = u° + ih to perturb a statistical model. Poon 
and Poon [23] defined a conformal normal curvature at uj° in the direction 
h as 



(3) B h 



1 h T H f h 
Itf/IU/h^Ip + V/Vph ' 



where || • \\m denotes the norm of a matrix such that ||i?/||M = y^ T [Bf] 2 - 
However, Ch is not scale invariant at any u with V/- ^ 0, because the normal 
curvature of IG(o^) = (u T ,kf{u)) T is given by 

r 1 khTH f h , r 

h (1 + k 2 V T f V f f/ 2 h r (I p + fc 2 V f Vj)h T h ' 

which may lead to ambiguous conclusions as k varies (Fung and Kwan [11]). 
The same problem also arises with B^, that is, B^ is not scale invariant. In 
particular, Fung and Kwan [11] argued that the conclusions drawn from the 
new graph IG(cj) should be the same as those from the old graph IG(u>) = 

(" T ,/M) T - 

2.2. Statistical perturbation manifold. We use p(Y\6,u) to denote the 
density function such that J p(Y\0,u:) dY = 1. To assess the local influ- 
ence of a model perturbation, we are primarily interested in the behavior of 
p(Y\6,u>) as a function of u around cj°, not the parameter vector 6. From 
now on, is assumed to be known or be fixed at a given value (e.g., the max- 
imum likelihood estimate) and p(Y\0,uj°) =p(Y\0). Moreover, p(Y\0,uj) 
satisfies the four regularity conditions on page 16 of Amari [1] and a; rep- 
resents no perturbation. 

The perturbed model p{Y\6,u) is characterized by a set of perturbations 
cj, which has a natural geometrical structure (Amari [1]). The perturbed 
model M = {p(Y|#, ld) : u 6 £1} can be regarded as a p-dimensional manifold. 
When a coordinate system u: is given, ej (i = 1, . . . ,p) are the natural basis of 
the tangent space of M associated with the coordinate system (Amari [1] 

and Li and McCullagh [18]). Let T^p be the vector space of M at w, which 
is spanned by p functions dil{u\ Y, 9), where ^(cj|Y,0) = logp(Y|0,u>). A 
natural isomorphism exists between these two tangent vector spaces and 
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TCj . The vector space is called the 1-representation of the tangent space 
of M. For any tangent vector h = X^=i £ T w , the 1-representation h(Y) 

of h in Tu is given by h(Y) = J2i=i h l di£(u\Y, 6), where di = d/duii. 
Definition 1. The inner product of two basis operators di and dj is 



where E w denotes the expectation taken with respect to p(Y\0,u). The p 2 
quantities gij(u>), i,j = l,... ,p, form the metric tensor. 

The metric matrix G{u) = (gij(u)) is an expected Fisher information 
matrix with respect to the perturbation vector u). The elements of G{u) 
measure the amount of perturbation that all components of a perturbation 
vector ijj contribute to a statistical model. The (i,i)th element gu(uj) itself 
indicates the amount of perturbation introduced by the ith component of 
u. The off-diagonal elements of G(u) represent the association between 



different components of u>. For instance, let rij(u) = gij(w) / J gu(u})gjj(u)). 
A large absolute value of rij(u:) indicates strong association between the 
ith and jth components of u. In particular, if G{u) is a diagonal matrix, 
then all components of u are orthogonal to each other in the perturbed 
model (Cox and Reid [9]). Moreover, if G(u) is not positive definite for a 
perturbation scheme, then p operators di are linearly dependent. Thus, this 
indicates that some components of the perturbation vector are redundant 
and these redundant components should be removed; for further discussion, 
see Section 3.3.2. 

Based on the above discussion, an appropriate perturbation to a statistical 
model should satisfy at least two conditions as follows: 

(a) G(uj) is positive definite in a small neighborhood of 

(b) the off-diagonal elements of G(u) at u>° should be as small as possible. 

Condition (a) is required to avoid any redundant components of u. Condi- 
tion (b) is required to ensure that we can easily pinpoint the cause of a large 
effect. For instance, if differing components of u are highly associated, then 
it is difficult to infer whether a large effect is caused by a single component 
of u or by several components of u. Therefore, an appropriate perturbation 
requires that G7(a; ) should be diag(gii(u?°), . . . ,g pp (u} )). Moreover, we can 
always choose a new perturbation vector a), defined by 



(4) 



gijiu) = (d i ,d J ) = E UJ [d l £(u\Y,e)d j £(u\Y,e)} 




(5) 



^ = a ,0 + c -l/2 G( ^0 ) l/2 (| 




such that G(u) evaluated at u>° equals cl p , where c > 0. Therefore, without 
loss of generality, we assume that an appropriate perturbation u satisfies 
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G(oj°) = dp. However, it is not generally possible to find a perturbation 
vector such that G(u) = cl p for all 

We introduce the following geometrical quantities for the perturbed model 
M based on the metric tensor. First, the length ||h|| 2 of a tangent vector 
h G T w is given by 

(6) ||h|| 2 = (h,h) = 5>'tf = h T G(u)h. 

Let C :co(t) = (u;i(i), . . . ,uj p (t)) be a smooth curve on the manifold M con- 
necting two points u^ 1 = u(ti) and u> 2 = wfa). The distance S^w 1 ,u> 2 ) from 
u> 1 to u) 2 along the curve C is given by 

(7) S(u\u 2 )=[ 

The skewness tensor T and a family of affine connections Y a for any a£ R 1 
are, respectively, defined as 

T ijk (u>) = E^HY, 0) <9/H Y, 6) d k £(u>\Y, 0)], 

and rg. fc (w) = E u [a i d j £(w\Y,O)d k e(u\Y,0)] +0.5(1 - a)^- fc (w). It can be 
shown that Tf jk (u) =T° ijk (u) - aT ijk (u) /2, where r?- fc (w) is the Christoffel 
symbol for the Levi-Civita connection of the metric tensor and 

TijfcH = |[%!jfc(w) + djg ik (u>) - d k gij(u})}. 

With the above quantities, the perturbation model M is a statistical man- 
ifold, which plays an important role in understanding the behavior of the 
perturbed model (Amari [1], Kass and Vos [14], Lauritzen [15] and Zhu and 
Wei [39]). 



5^&j(w(t)) 



duii{t) dojj(t) 



dt 



dt 



dt. 



Definition 2. A statistical perturbation manifold (M,G(u),T(u)) is 
the manifold M with a metric G(u>) and a covariant 3-tensor T(u). 



Now we consider a specific smooth curve in M, called an a-geodesic. 



Definition 3. w{t) is called an a-geodesic with respect to the affine 
connection Tfj k (u) if it satisfies the equation 

»> ^ + S 9 "M«))r*Mi))^^ = o, 

s,j,k 

where g ls (uj) is the (z,s)th element of G(u>) -1 . 
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The geodesic is a direct extension of the straight line u>(t) = cj° + th in 
Euclidean space (Amari [1] and Kass and Vos [14]). In particular, as we 
move along a geodesic, the tangent vector of the geodesic does not change 
in length and direction. If Tfj k (u) = for all u, then the manifold is ce-flat 
and the geodesic equation for this a is linear in t: u)(t) =u° + th. 

Some important properties related to the above geometrical quantities 
are summarized in the following lemma, whose proof can be found in Amari 
[1], pages 40, 51-52. 

Lemma 1. Let (ft = ((ft , ...,(ftP) = 4>{u) be a new coordinate system of 
M , d a = d/d<p a , Bf = d(j) a /duJi and B\ = duji/d<ft a . Then the geometrical 
quantities of M in the coordinate system (ft can be written as g a t, = Ylij BaB^gij, 
T abc = E^, k B l a BlB k c T ljk and T« bc = V, ; , /if |- , + V,,/,,,/i;^. 
We use the indices i,j,k, and so on, to denote quantities related to the 
coordinate system lj and the indices a, b, c, and so on, to denote quantities 
related to the coordinate system (ft. 

2.3. Influence measures and their properties. Let f{u)) : BP — ► R 1 be the 
objective function (e.g., the likelihood displacement function in Cook [7] or 
the residual sum of squares in Wu and Luo [32]), which defines the aspect 
of inference of interest for sensitivity analysis. Let uj(t) be a smooth curve 
on M with u(0) = u° and du(t)/dt\ t=Q = h G T w o. Therefore, f(u>(t)) is a 
function of u;(t) defined on the perturbation manifold M. It follows from a 
Taylor series expansion that 

(9) = /(«(0)) + / h (0)t + i/ h (0)t 2 + o(t 2 ). 

The first and second derivatives of f(u(t)) at t = are, respectively, given 
by 

( 10 ) /h(0)=E^\=Vjh and / h (0)=h^ /h + Vj^. 

If V f 0, then the first-order term /h(0) mainly characterizes the local 
influence of a perturbation vector u> to a model. However, if V/ = 0, then it 
follows from (9) and (10) that /h(0) and /h(0) reduce to zero and h T Hfh, 
respectively; therefore, we must use the second order term /h(0) to assess 
the local behavior of the objective function when V f = 0. 

An important question is how to assess the local influence of minor pertur- 
bations to a model when V/ 7^ 0. This is the so-called first-order approach in 
Wu and Luo [31]. For instance, in a transformation model, / can be defined 
as the transformation parameter estimate, whose first derivative does not 
equal at u>° (Lawrance [16]). We introduce a first-order influence measure 
as follows. 
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Definition 4. The first-order influence measure (FI) in the direction 
h £ T^o is defined as 

h T V/Vjh 

( U ) FI /,h ~ FI /(^Q),h ~ h TQ h ' 

where G = G{u°). 

The proposed FI has an interesting geometrical interpretation and is in- 
variant with respect to arbitrary reparametrizations at any point in uj. We 
are now led to the following theorem. 

Theorem 1. We have the following results: 

(i) FI /jh = lim^o [f(u(t)) - f(u(0))] 2 /S(u(0),u(t)) 2 . 

(ii) If (j) is a diffeomorphism of u, then Fly^^h is invariant with respect 
to any reparametrization corresponding to <p and Fl^h = fc 2 FIf h holds for 
any k. 

Proof. It follows from (7) that 5(w(0), u(t)) 2 = t 2 h T Gh + o(t 2 ). Using 
l'Hopital's rule and (9), we can prove (i). Assuming cj = u>(0) and = </>(u>), 
the Jacobian matrices of the above coordinate transformations are given by 
$ = dcf)/du> and ^ = dui/dcf). Differentiating the identities <j>[u)(<j))] = 4> and 
u>[(f>(u>)] = u> with respect to cf> an< 3 w, respectively, leads to = <&^> = I p . 
Thus, we have G(<j>) = fy T G(uj)ty and V^O) = ^ T V /(a ,o), where 0° = u°. 
Using Definition 4, we can prove (ii). □ 



The statistical significance of Theorem 1 is two-fold. First, Theorem l(i) 
indicates that the first-order measure is associated with the first derivative 
of f(u(t)) on M evaluated at t = 0. If M is a Euclidean space, h T h = 1 and 
oj(t) = th + u , then Fl^h reduces to the square of the directional derivative 
of / at u° in the direction h, given by lim^oLf (k' + *h) — f(uj°)] 2 /t 2 . 
Second, although u may not be an appropriate perturbation, we can always 
use G to obtain an appropriate perturbation u in (5), which yields 

h T G , - 1 /2 V/ VjG- 1 /2h 
FI/(ii),h|<i=u>° — h^h ' 

The maximum value of Fl^h equals VjG -1 V f, which quantifies the de- 
gree of local influence of a) to a statistical model, while the corresponding 
direction vector h max = can be used for identifying influential ob- 

servations (Lawrance [16]). 

We use /h(0) to assess the second-order local influence of u to a statisti- 
cal model, even when Vj 7^ 0. The approach which utilizes the information 
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i n /h(0) is called the second-order approach (Wu and Luo [31, 32]). How- 
ever, for a general curve u>(t) on M, /h(0) may not be geometrically well 
behaved (Murray and Rice [20]). Instead, we only consider the 0-geodesic 
uj(t) associated with the Levi-Civita connection of the metric tensor G(u), 
which is unique and defined in an interval containing such that = cj 
and duj(t) jdt = h G T^o . Then, we can obtain a covariant version of Taylor's 
theorem (Murray and Rice [20] and McCullagh and Cox [19]) as follows: 

(12) f(u(t)) = f(u°) + tVjh + lt 2 h T Hjh + o(t 2 ), 

where = H®, ^ and the (i,j)ih element of H^>^ is given by 

s,r 

The matrix H®,^ is called the covariant Hessian of f(oj) (Zhu and Wei [39] 
and Murray and Rice [20]). Because r° fe (w) is symmetric with respect to 
i and j, it can be shown that H®,s is a symmetric matrix. In particular, 
/(u>) sa tisfies the following property (Murray and Rice [20]). 

Lemma 2. Let cf) be a diffeomorphism of u with Jacobian matrix \I/ = 
du/d<t>. ThenH m = ^ T H° m m. 

Lemma 2 shows that is a 2-tensor, so it is geometrically well be- 

haved. 



Definition 5. The second- order influence measure (SI) in the direction 
h G T^o is defined as 

h T H° f h 

(13) SI /jh = SI/ (w o) }h = hTGh ■ 

The standardized SI (SSI) in the direction h 6 T^o is defined as 

1 h T H° f h 

(14) SSI /)h = SSI /(w o ))h = ||G _^o| k h T Gh • 
We can establish the following properties of Sl/^h and SSL^h- 



Theorem 2. We have the following results: 
W SI /{a ,o )ih = hm t _ >0 2[/(^(t))-/(^(0))-tVjh]/5(u;(0),^(t)) 2 . 
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(ii) Suppose (f> is a diffeomorphism of u. Then Slf^cni, and SSIy^o^h 
are invariant with respect to any reparametrization corresponding to at 
u° . Moreover, 

(15) SIkf(w),h = kSIf( w -) >h and SSI^( w ) )h = SSIf( w ) )h , 

for any kj^O and u) £ f2 . 

(iii) Let {(Aj,Uj),i= l,...,p} be the eigenvalue-eigenvector (E-E) pairs 
of with respect to G. Then, for any direction h, we have < SSI/^ < 1, 

SI/, Uj = \ an d SSI/.Ui = Aj : 



where Aj zs £/ie normalized eigenvalue. 

Proof. Using (7), (12) and l'Hopital's rule, we can prove Theorem 2(i). 
Because a diffeomorphism exists between uj and <f> such that u = u>(0) and 
= <p(ui), we have ^"1? = 3>\l/ = l p . Moreover, because G(cf)) is a metric 
tensor and is a 2-tensor, we have 

G(<t>) = H> T G(u)y and #° (0O) = * T flJ (w0) *. 

Consider a geodesic with u;(0) = and du(0)/dt = h G T w o. Then 
4>(u(t)) is a geodesic in the ^-coordinate such that 4>(uj°) = <p° and dd>(u(0))/dt 
$h. If G is positive definite, then it follows from Lemmas 1 and 2 that 

Oi /(0°),*h h T $ T G (^0 )$h h r $ T tf T G(a; )*$h °V(«°).h- 

Similarly, we can show SSIy^O) $h = SSIj^o^h- Thus, SI/^ and SSI^h are 
invariant with respect to reparametrization <fi at u°. For any k, we have 
Hkf(w) = ^-^/(w) an< ^ equation (15) holds for any cj and k^=0. This proves 
Theorem 2(ii). By using Definition 5, we thus prove Theorem 2 (iii). This 
completes the proof of Theorem 2. □ 

Theorem 2 has the following implications. First, if u) is an appropriate per- 
turbation and V/ = 0, then SSI/ ; h = -E>h and SI/ 5 h = Ch- We note that most 
of the examples in Cook [7] fall into this scenario. In general, even though we 
may choose a perturbation ld which is not appropriate, we can always use G 
to obtain an appropriate perturbation a) in (5). In this case, the normal cur- 
vature and the second-order influence measures will lead to the same results 
when V/ = and the chosen perturbation is appropriate. Therefore, the 
diagnostic method proposed here can be regarded as an extension of Cook's 
[7] local influence approach in a more general setting. Second, SIf( w ) h and 
SSIyv w ) h are scale invariant even when V/ 7^ 0, whereas Ch and Bh are 
not (Fung and Kwan [11]). This generalization facilitates new methods and 
techniques for doing sensitivity analyses of a statistical model. 
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2.4. New local influence approach. What follows are the four key steps 
in assessing local influence of perturbing a parametric model p(Y\0): 

Step 1. Choose a perturbation scheme u such that J p(Y\6,u) dY = 1. 

Step 2. Given the perturbed model, calculate the geometrical quantities 
[e.g., gij(i*)), Tijk(u}), and Tf- k {u)] of the perturbation manifold. 

Step 3. Check whether the perturbation u is appropriate, that is, G(u>°) = 
dp. If yes, go to Step 4 below. Otherwise, find a new perturbation scheme 
and go back to Step 2. 

Step 4. Choose an objective function f(uj). If V/ = 0, then use SI and 
SSI to assess local influence of minor perturbations to a model. However, if 
is nonzero, then use FI, SI and SSI together. 

3. Appropriate perturbations in four examples. We examine four exam- 
ples to illustrate how to calculate geometrical quantities for a perturbation 
manifold and show how to find an appropriate perturbation in the exam- 
ples. We also consider several objective functions to assess the local influence 
of an appropriate perturbation to a parametric model in each of the four 
examples. 

3.1. Case-weight perturbation. Suppose that Yi, . . . , Y n are independent 
with mi = ••• = m n = 1 and p(y\0) can be written as 117=1 p{Uu 0) ■ We 
consider case- weight perturbation, in which L(6\u) is given by 

n 

(16) L(o\u) = J2"Ayi;9), 

1=1 

where £(yf,6) = logp(in; 0). Thus, p = n and w = 1„ is an n x 1 vector with 
all elements equal to 1. The density of the perturbation model p(y\9,u) is 
given by 

n 

(17) p(y\0,u>) = ]J{eM^(yi;9)}[cM;0)r 1 }, 

i=l 

where Ci(uJi]6) = J exp{uii£(yi;0)} dyi for all i = 1, . . . ,n. After some alge- 
braic calculations, we have the following results. 

Theorem 3. If the four regularity conditions of Amari [1] hold for 
p(y\9,u), then the following results hold for case-weight perturbation: 

(i) di\ogCi(u)i]6) =E UJ [£(y i ;0)] and d e log Ci{uji\0) =uj i E u ,[d0£(y i ;O)) ; where 
the expectation E u is taken with respect to p(y\0,u); 

(h) dfiogc i (u i -e) = v&T UJ [e(y i ;e)}, 

djlogaiui-0) =u H E w [d$£(y i ;G)] +«?Var w [^(iK;0)] 
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and 

dedi\ogCi(oJi;9) 

= E UJ [d e e{y i ;e)] 

+ UiEUideKyf, 0) - E4d e £(yi;0)]}{£(yi;0) - E^fa; 0)]}); 

(iii) (/ y (w) = Vaxu[£( yi ; 9)}S ij , Tf jk (u) = 0.5(1 - a)T ijk (u) and 

T ljk {u>) = E w [{£(y 4 ; 6) - E^fo; 9)}} 3 ]S ij S ik /or i,j,k = l,...,n, 

where Sy is the Kronecker delta; 

(iv) i/ie geodesic u = u(t) with respect to Yf- k {uj) satisfies 

|Jexp|| /(Or^(^)^}^ = M 

/or i = 1, . . . , n, where cj(0) = u>° and du>(t)/dt = h = (h\, . . . , h n ) T . In par- 
ticular, u>(t) = uj° + th is a 1-geodesic. 

PROOF. The log-likelihood function of p{y\9,u) is given by 

n 

e(u\y, 9) = logp(yto) = $>i^; 0) - logc^; 0)], 

i=i 

where rj = (0,w). By using E w [^(u;|y, 9)] = and E w [-S^(w|y,0)] = 
E a ;{[5, ? £(ci;|y,0)][^£(a;|y,0)] T }, we can obtain (i) and (ii). 

By differentiating £(u\y,9) with respect to u, we have di£(u\y,9) = 
£{yi\ 0) — dilogCi(u>i; 0) and <9?£(cj|y, 0) = — <9? logCj(cjj; 0). Therefore, we 
can directly calculate the geometric quantities gij(u:), Tf- k {u) and Tij k (to), 
which lead to (iii). 

The geodesic u = cj(i) with respect to T"- k {u) satisfies a second-order dif- 
ferential equation which is defined by d 2 Ui{t) / dt 2 + g 11 (^T^^u^dio^t) / dt] 2 = 
0, with initial conditions u)(0) = cj° and du{t)/dt = h. We can prove (iv) by 
solving this second-order differential equation (Coddington [6]). 

Theorem 3 establishes the manifold of case- weight perturbation in (16). If 
the Yi are also identically distributed, then G(u°) = gn(u>°)I n and the per- 
turbation in (16) is an appropriate one. Moreover, if we treat both responses 
and covariates as random variables, the perturbation in (16) is appropriate 
even for the regression case. In general, if 

G = diagtVar^o [£(yi;9)}, . . . , Var w o [£(y n ;9)}) + cl n 

for any c > 0, then we consider a new perturbation u> in (5) with c = 1 such 
that G(u) = l n at u° and 

n 

(18) L(0\u>) = El 1 + v/Var^o^;©)]^ - l)]£( yi ; 9). 

i=i 
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With the development above, we can now choose an objective function (e.g., 
the likelihood displacement) and calculate its associated influence measures 
{FI, SI, SSI} to assess local influence of the perturbation (16). For instance, 
if we are interested in a particular component of 6, say, 0±, we may use 9\{u) 
as an objective function, where 6\{u) is the maximum likelihood estimate 
of 6\ under the perturbation (18). □ 

3.2. Location- scale family. Suppose that Y±, . . . ,Y n are independent and 
mi = ■ .. = m n = 1. Let 0= ((3 T ,cr 2 ). Each p(yf, 6) = a~ 1 p (a^ 1 (yi- fi(xf f3))) 
belongs to a location-scale family such that po is a known density satisfying 
/ xpo(x) = and f x 2 po(x) = 1, where fi(-) is a given function and Xj is a 
qi x 1 vector. Thus, E(yi) = = /j,(xff3) and Var(yj) = a 2 . 

We consider three different perturbations: case-weight perturbation, per- 
turbation of the variance and perturbation of the response. Then we establish 
a perturbation manifold for each of these perturbations as follows. 

For case- weight perturbation, since £(yi\ 6) = logpo(c _1 (y« — Mi)) ~~ log ") 
we can use the transformation a = {yi — Hi)/o to obtain gij{u) = 
Var aJ: o[logPo(ej)]%- Similarly, we can calculate Y"- k {u) = (1 — a)Tijk(u)) and 

Tijk{u) = E w ,o({logpo(ei) - E u)fi [logpo(e i )]} 2 )5i j 5kj. 

For the perturbation of variance, we consider a heterogeneous variance of 
y = (yi, . . . , y n ) T such that Var(y) = a 2 diag(l/u 2 , . . . , 1/w 2 ). In this case, 
u° = l n and p = n. The log-likelihood function for the perturbed model 
is given by -ralog cr + J2f=i^oguji + Ya=i logpo(*i), where U = Wjcr" 1 ^ - 
/i(xf/3)]. After some algebraic manipulations, we get gij(^) = 
5ijUJ~ 2 E {[xd x logp (x) + l] 2 }, T ijk (u) = 5 ij 5 ik uj^E {[xd x logp (x) + l] 3 }, 
and 

F« k (u) = -5^5^ (E {[xd x logp (x) + l] 2 } 

+ 0.5aE {[xd x logp (x) + l} 3 }) 1 

where d x = d/dx and the expectation Eo is taken with respect to po(x). 

For the perturbation of response, we consider adding a perturbation uji 
to yi such that £(u\Y,0) is given by — nlogcr + Ya=i l°gPo( cr ~ 1 [yj + ~ 
/i(xf/3)]). Let ti = a~ l [yi + uji — /u(xf j3)]. With some calculations, we can ob- 
tain gij{u») = 5ija~ 2 Eo{[d x logp (x)} 2 }, T ijk (u) = 5ij5 ik a~ 3 Eo{[xd x logp (x) + 
l] 3 } and Tfj k (u) = —0.5aTij k (u). Thus, since T^ k (u) vanishes for all i, j, k = 
1, . . . , n, the straight line u(t) = u° + th is a 0-geodesic. 

Combining the above results, we have the following theorem. 

Theorem 4. // all p{yi]6) = a^ 1 po(o-~ 1 (y i - /x(xf/3))) belong to the 
same location-scale family, then 

G(uj°)=cl n and C h = cSl LD ^o) h 
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hold for case-weight perturbation, the perturbation of variance and the per- 
turbation of response, where LD(u) denotes the likelihood displacement func- 
tion in Cook [7J. 

Theorem 4 indicates that the three perturbation schemes considered here 
are appropriate perturbations. Therefore, for LD{u) introduced in Cook 
[7], both {SI, SSI} and the normal curvature lead to the same results for 
location-scale families under the three commonly used perturbations. 

3.3. Linear regression model. Consider the linear regression model 
(19) Y = X(3 + e, 

where Y is an n x 1 vector of responses, X is an n x q\ covariate matrix, (3 
is a qi x 1 vector of unknown parameters and e = (ei, . . . , e n ) T is an nxl 
random vector of errors with distribution iV[0,o" 2 I n ]. Here = (j3,a 2 ). In 
this section we fix at its maximum likelihood estimate. 



3.3.1. Perturbation of error variances. We consider a perturbation to 
error variances via an n x 1 perturbation vector u such that 

(20) Var(e) = a 2 diag(l/w ls . . . , l/w n ) 

and oj° = l n . It has been shown in Theorem 4 that gij{<^) = 0.5co^ 2 5ij, 
Tjjfc(cj) = —uj^5ij8jk and Yf- k {u) = 0.5(1 — a)Tijk(u)). Thus, the perturba- 
tion (20) is an appropriate one. However, for illustrative purposes, assume 
that we consider another perturbation scheme = (0i, . . . , 4> n ) T such that 

u 

(21) Var(e 1 ) = ( r 2 - -5—-, Var(e 2 ) = a 2 ^\ . . . , Var(e n ) = a 2 ^ 1 , 

k - 1 + 0i 

where ko > 0. Therefore, it can be shown that 0° = u° and G(0°) = diag(l/2fe 2 , 
1/2, . . . , 1/2). Thus, G(0°) = cl n if and only if ko = 1. That is, the pertur- 
bation vector is appropriate only when ko = 1 in (21), which reduces to 
(20). 

The perturbation of error variances is applied to the residual sum of 
squares, that is, /(<*>) = —RSS(uj) = — r(u) T r(u), where r(u>) is the 
residual vector under the perturbation (20). It can be shown that at 
a; = l n , V-rss = (~rl -r 2 n ) T and H. RSS = 2D(r)P x D(r), where P x = 
(Pij) = X(X T X)~ 1 X T and D(r) = diag(n, . . . , r n ), in which ri,i = 1, . . . ,n, 
are ordinary residuals when cj° = l n . Because G = 0.5I n , Fl^^h = 
2(h T V_R55) 2 /h r h for any h. In particular, we can obtain the maximum 
value of Fl.^s^h as Ya=i r t an d the corresponding direction vector is given 

by h max = ^-rss/ \Jj2j=i r j> which is the same as Lawrance's [16] diag- 
nostic. Subsequently, we use Sl-^s^h to assess the local influence of the 
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perturbation (20) to model (19). For a = 0, it can be shown that H^_ RSS = 
2D(r)PxD(r) — D(r 2 ), where D(r 2 ) = diag(r 2 , . . . , r 2 ). Thus, we have 
Sl-Rss,Ei = ^rl(2pu - 1.0)<7~ 2 m -2rfa~ 2 for i = l,...,n, where Ej is an 
n x 1 vector with ith element one and zero otherwise for i = 1, . . . ,n. There- 
fore, if an observation has a large absolute residual, then it will be identified 
as influential by using h max and SI-rss^ ■ 

3.3.2. Perturbation of the explanatory vector. Consider the perturbation 

(22) X U = X + WS, 

where W = {yjik) is an n x qi matrix of perturbations, S = diag(si, ...,s qi ) 
with Si 7^ for each i and s k converts lo^ to the appropriate size and unit so 
that SfcWjfc is compatible with the (i,fc)th element of X. With some calcu- 
lations, it can be shown that G(u) = Sf30 r S ® I n /er 2 , where ® is the Kro- 
necker product of two matrices. Because d£(u\Y , 6) / du>ik for k = 1, . . . ,q± 
are linearly dependent, the metric matrix is only positive semidefinite. In 
this case, the singularity of G{u) indicates that we have introduced too 
many perturbation parameters; and therefore, some perturbation parame- 
ters should be removed. One possibility is to transform u> € BP into u* € R r 
(p > r) and make the metric tensor positive definite in the new coordinate 
system. 

Another perturbation scheme is 

(23) X UJ = X + W*1*S, 

where W* = diag(u>i, . .. ,co n ) and 1* is an n x gi matrix with all elements 
equal to 1. After some calculations, we can obtain gij(u)) = <5m(X}?Li s kPk) 2 /°~ 2 
and Tijk(u>) = Tf- k (u) = for all i,j,k = l,...,n. The corresponding per- 
turbation manifold is a-flat for any a £ R 1 (Amari [1]). Moreover, the com- 
monly considered line oj(t) = cj° + th is a geodesic with respect to Yf- k {u) 
for any a € R 1 in this perturbation manifold. In particular, the perturba- 
tion (23) is an appropriate one, because the metric matrix G(u) = cl n and 
is independent of u>. 

3.4. Testing a parametric family. Suppose that Y%, . . . ,Y n are indepen- 
dent and identically distributed and the density of Yi, denoted by Po(yi]0), 
belongs to a certain parametric family, such as a Gaussian distribution. We 
consider a log-linear expansion perturbation of po(yi', 6) as follows (Claeskens 
and Hjort [5]). Let {ipj(yi]0) :j = 1, ... , m} be a set of functions such that 
they are orthogonal with respect to po(yi;0) and also orthogonal to ipo(yi]0) = 
1. That is, / ipj(yi;O)ijj k (yi;0)p o (yi;6) dyi = S jk E (ipl) for all j, k = 0,...,m 
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and i = 1, . . . , n, where Eo denotes the expectation with respect to Po(yi',0). 
For each i, the log-linear expansion perturbation is defined by 

(24) p(y;;6>,u;) = p Q { yi] e)c{u)~ l expj^^Vj^^; )j, 

where c(u) = J Po(yi]0)exp{Y?JLi u; j'ipj(y , i',9)}dyi. Thus, the log-likelihood 
function for the perturbed model is given by 

n m 

£(u\Y, 0) = Y, logp (yuO) + n ^ u^(0) - n0(w), 
i=i j=i 

where tpj(9) = J27=i 4>j(yu6)/ n an d <A(<^) = logc(u;). In this case, cj° = m , 
anmxl vector with all elements 0, and p = m. 

After some algebraic derivations, we can obtain the geometrical quan- 
tities of the perturbation (24) as follows. Because dj£(w\Y,0) =n r ipj(0) — 
ndj(j)(u)), we have gjk(<^) = ndjdk4>(u), Tj^u) = ndj di<p{u) and 
I^ w (w) = 0.5(l-a)T,-fci(c«;) (Amari [1]). In particular, G(w°) = ndiag(E (^f ) 
. . . , Eo(V'm))- ^ can be shown that the perturbation in (24) is an appropriate 
one if and only if Eo(V'J) are homogenous, that is, Eo^i) = • • • = Eo(V'm)- 
Even though Eo(V'J) are not homogenous, we can always choose a new per- 
turbation G) = G^uj ) 1 / 2 ^ such that 



(25) p(y. i; 0,cj) =po(y l ;0)c(^)- 1 exp|^cD i ^(y i ;0)/ v /E o (V'|)|. 

This u is an appropriate perturbation with G(tj)|o m = I m - 

We consider the log-likelihood ratio /(£>) =£(£j[Y,G) — \ogp(Y\0) as our 
objective function. Direct calculation leads to 

Vf = Vn[ , and FI /)h = L, . 

V^Eo^f) v/EotO/ hTh 

The maximum value of FI /)h is VjV/ = «Ef=i W(0) 2 /EoW$), which is the 
well-known score test statistic for testing Hq : cj = when is either known 
or replaced by its estimate (Claeskens and Hjort [5]). Moreover, for each 
Ej, FI^Ej = nipj(9) 2 /Eo(tpj) is the score test statistic for testing Hq :ujj = 0, 
where Ej is an m x 1 vector with ith element equal to one and zero otherwise 
for j = 1, . . . , m. We can use FIj Ej to detect the most influential perturba- 
tion from all m perturbations. Under some conditions (e.g., m grows slowly 
with n), the score statistic VjV/ converges weakly to a nondegenerate ran- 
dom variable (Claeskens and Hjort [5]), which can be used to characterize 
the asymptotic behavior of the influence measures FIjh- 

Combining the above results, we have the following theorem. 
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Theorem 5. If Po(y,9) belongs to a certain parametric family, then 
the perturbation u in (25) is an appropriate perturbation. In particular, the 
maximum value o/FI^h is the score test statistic for testing the hypothesis 
H :uj = 0. 

4. Application to linear mixed models. We consider data that are com- 
posed of a response yij and a q± x 1 covariate vector Xjj for j = 1, . . . , rrii 
within clusters i = 1, . . . , n. We define the linear mixed models as 

(26) y i = x i (3 + e i , 

where y« = (ya, . . . ,yi mi ) T is an m,xl vector, xj = [xa, . . . ,x im J, (3 is a 
qi x 1 vector of unknown parameters and is normally distributed with 
mean zero and covariance matrix Sj = Xj(£), in which £ is a g2 x 1 vector. 
Thus, T = (/3 T , £ T ) is a g x 1 vector, where q = qi + q2- 

For the linear mixed model, because the estimates of 9 (e.g., maximum 
likelihood estimates) may heavily depend on a small portion of the data 
or even one observation (or cluster), it is important to detect both influen- 
tial clusters and influential individual observations. However, at either the 
subject or individual level, we cannot distinguish between influence due to 
the specific cluster characteristics and influence due to the characteristics of 
specific observations within a cluster. For further discussion on these issues, 
see Ouwens, Tan and Berger [21], Zhu and Lee [37, 38], Zhu, He and Fung 
[41] and Fung, Zhu, Wei and He [12], among many others. 

The likelihood displacement function (Cook [7]) will be used throughout 
this section. We define L{9) =logp(Y|0) and L(0\u) = logp(Y|#, u). Let 
9 and 9^ be the maximum likelihood estimates of L{9) and L(9\u), respec- 
tively; the likelihood displacement function is given by LD(u) = 2[L{9) — 
L(0 a; )]. It can be shown that Hld = 2A T (— L) _1 A, where A is a q x p 
matrix with elements d 2 L{9\u)/d9idiOj and L is a q x q Hessian matrix 
with elements Lij = d 2 L{9) / dOidOj evaluated at 9 and u°. We calculate 
the Hessian matrix — L as 

x 7 1 vjr 1 X' 

where <9g£j(£) = <9vec(Xj(£))/<9£ is a q2 x mj matrix, in which we define 

vec(Z) — (-Zii , . . . , z\ mi , . . . , z mi \ 1 . . . , z mirrii ) 

for any m,j x rrii matrix Z = (zij). We calculate the geometrical quantities 
of the perturbation manifold, the A = d 2 L / '89 du matrix and the influence 
measures below. 
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4.1. Perturbation of individual covariance matrix. We consider the per- 
turbation of the individual covariance matrix by assuming that 

(27) Cov(y,;) = oj^^-'Ei for all i = 1, . . . , n. 

Thus, u) — l n and p — n. For the perturbed model, both Z/(0|cj) and 
i(u)\Y,6) equal 



n 

i log | Si | + \^rii log toi - YY^Ui{yi - x i /3) T S J ~ 1 (y i -Xj/3). 



After some calculations, we have gij(u) = 0.5miUJ i 2 o~ij, Tijk{u>) = —rriiUJ i 3 x 
Sijdik and r?. fc (w) = 0.5(1 - a)T ljk {u). At u°, G = diag(0.5mi, . . . ,0.5m n ) 
indicates that the amount of perturbation introduced by Ui is proportional 
to mi, the number of observations in the ith cluster. Thus, u is an appro- 
priate perturbation if and only if mi = ■ ■ ■ = m n . Although m; may not be 
homogeneous, we can always consider an appropriate perturbation u in (5) 
such that G(u))\ w a = I n and 

(28) Cov(y i ) = [l + (cD i -l)/y05^]S i for i = 1, . . . ,n. 

For the appropriate perturbation (28), we get 

d 2 L(6\Cb) _ / ___ 1/2 y 1 n , „T A1 T\T 



A, = = y05< 1 / 2 (2ef £ri Xi , [^(Er 1 ® ST" 1 ) vec^ef )f) 

where = yi — Xj/3. For simplicity, let /3 be the parameter of interest. It 
can be shown (Cook [7]) that 

C E% = Sl LDtEl = 2mr 1 e(Z- 1/2 P ii Z- 1/2 e i , 

where P u = Sr^x^E^ixjE^j^xfs: 1 / 2 . 

4.2. Perturbation of responses. 

4.2.1. Scheme one. We consider the perturbation 
(29) yi(u)=yi+uJil mi . 

Thus, a; = n and p = n, where n represents annxl vector with all 
elements equal to 0. For the perturbed model, both L(6\u>) and £(u\Y,0) 
equal 

n n 

-|^log|Ej| - l^2(yi + Uil mi -itif3) T '£7 1 (y i +Uil mi -Xj/3). 

i=l i=l 

After some calculations, we have gij{^) = l m .Sj rl \ m fiij , ?ijfc(cj) = and 
T ijk( u ) =0 - In this case > G =di a g( 1 mi^l ll mi,---,lm n T 'n ll m n ) and the 
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ith. diagonal element of G also depends on the number of observations in the 
ith cluster. This perturbation manifold is a-flat for any a £ R l and U)(t) = th 
is a geodesic with respect to Tfj k for any a. However, u is an appropriate 

perturbation if and only if l^E^j - l mi = ••• = l^E" 1 ].^. Therefore, u 
may be not an appropriate perturbation, but we can always consider an 
appropriate perturbation a; in (5) such that G{Cb)\ OJ o = I n and 



n. 



(30) =y l+ u H l mi /^lT.Xr 1 l m . for i = 1 

For the perturbation u in (30), we have 

A, = (l^Efixi, [^(E" 1 <g> S- 1 ) vec(l mi ef )] T ) T . 
Let £ be the parameter of interest. It can be shown that 

Ce, = 4(1^. Er 1 ]^.) -1 

xvec(l mi ef) T (Sr 1 / 2 
where Qa is defined as 



I, 1/2 )Qn^i i/ ')vec(l r 



-1/2 



-1/2, 



-1/2 



Lj=l 



-1/2 



S" 1/2 ). 



4.2.2. Scheme two. We consider the mean shift perturbation model 
(31) y*(u;) =yi + 

where = (m,. . . , w im J T . Thus, u 0T = (0^ , . . . , O^J and p = £™ =1 
For the perturbed model, both and £(u\Y,6) equal 



- \ log | S, | - \ ]T (y* + Wi - x,/3) T E - 1 ( yi + co, - Xi/3) . 

i=l i=l 

After some calculations, we have gij(u)) = E" 1 ^-, Tjjfc(cj) = and r? fe (u;) = 
0, where i, j and k vary from 1 to n. The structure of the metric tensor 
G(u) = (gij(u)) indicates that the perturbations u>j in different clusters are 
orthogonal to each other, whereas the components {uju : Z = 1, ...,mj} of 
cjj are associated with each other. This perturbation manifold is also a- 
flat for any a € R 1 and u)(t) = th is a geodesic with respect to Ffj k (u) for 
any a. However, u is not an appropriate perturbation, because G(u)\ w o 
does not have the form &m, where M = Ya=i m i- Therefore, we consider an 
appropriate perturbation u in (5) with c= 1 such that G(u))\ u o = I n and 



(32) 



for i = 1, 



, 71. 
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For the perturbation u> in (32), we have A, = [Er 1 ^, (ef E" 1 ® Er^Ef ] T . 
Let £ be the parameter of interest. It can be shown that (Cook [7]) 

R = 4E}' 2 (ef ^ s -V2 )Qii(s -i/ 2ei E -i/2 )E i/2 

is the submatrix of iiz,,D corresponding to the ith perturbation vector a)j. 
Therefore, Ce^ , = SI^e,;, which corresponds to the Ith diagonal element 
of Rj, where Ej^ is a p x 1 vector with a 1 at the (X^=i m fc + Oth element 
and otherwise. 

4.3. Yale infant growth data. The Yale infant growth data were collected 
to study whether cocaine exposure during pregnancy may lead to the mal- 
treatment of infants after birth, such as physical and sexual abuse. A total 
of 298 children were recruited from two subject groups (cocaine exposure 
group and unexposed group). The key feature of this dataset is that differ- 
ent children had different numbers and patterns of visits during the study 
period. We refer to Wasserman and Leventhal [29] and Stier et al. [24] for a 
detailed description of the study design and data collection. Recently, Zhang 
[34, 35] developed multivariate adaptive splines for the analysis of longitudi- 
nal data (MASAL) to analyze the Yale infant growth data. The importance 
of our reanalysis here is to develop a local influence approach for the MASAL 
model. 

For the Yale infant growth data, Zhang [35] selected the MASAL model 

Vij = 4" 

where x^ = (l,d, (d-120) + ,(d- 200)+ ,(g a - 28)+ ,d(g a - 28) + ,(d-60) + {g a - 
28)+, {d-A90) + (g a -28)+, sd, s(d- 120)+ ) T , in which d and g a are the age of 
visit and gestational age, respectively, and s is the indicator for gender, with 
one indicating a girl and zero indicating a boy. In addition, we assume that 
= (eii) • • • }£im,i) T ~ N[Q, Ej(£)] and Ej(£) is determined by the variance 
and autocorrelation parameters, which are, respectively, given as 

V(d) =exp(£ + 6d + £2d 2 + £3d 3 ) and p(0 + 

where I is the lag between two visits. For simplicity, we assume that all knots 
are given so that the MASAL model reduces to the linear mixed model (26). 
The total number of data points is Y^7=i m i = 3176 and the total number of 
clusters is n = 298. The estimated parameters are 

/3 T = (0.744, 0.029, -0.0092, -0.0059, 0.204, 

0.0005, -0.0007, -0.0009, -0.0026, 0.0022) 

and 

£ = (-0.53,0.0064,-1.9 x 10" 5 ,2.1 x 10" 8 , 0.929, -0.0013) T 

(Zhang [35]). 
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We calculated the local influence measures for the three perturbations 
discussed in Sections 4.1 and 4.2 and present the main findings of the local 
influence approach in Figures 1-3. 

For the perturbation of the individual covariance matrix, the quantity SIe 4 
for perturbation (28) reveals four influential subjects {141,246,269,285} 
(Figure 1), whereas subjects {141,246} do not stand out as influential using 
normal curvature for perturbation (27). Because m\w = 4 and m^m = 5 are 
much smaller than the average number of observations 3176/298 = 10.6, a 
relatively large normal curvature from subjects {141, 246} represents a large 
effect. A closer inspection of the data (not presented here) shows that the 



(a) (b) 




Subject Index Normal curvature for peterbation (27) 



Fig. 1. Yale infant growth data: (a) index plot of 3176 x SlEi/298 for the appropriate 
perturbation (28); (b) Ce ; for perturbation (27) against 3176 x SlEi/298 for the appro- 
priate perturbation (28). 




Fig. 2. Yale infant growth dataset: (a) metric tensor ga(u ) for perturbation (29) and 
the number of observations for each subject; (b) Ce; for perturbation (29) and 1.9 x SIe ; 
for the appropriate perturbation (30). 
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raw and fitted curves for these four subjects differ substantially, especially 
at the last observation. 

The metric tensor ga(u°) for perturbation (29) is positively correlated 
with the number of observations in each subject [Figure 2(a)]. Furthermore, 
because the variability of ga(u°) is relatively small, normal curvatures under 
perturbation (29) are close to second-order influence measures [Figure 2(b)]. 
Both C El and SI El reveal four influential subjects {111,116,246,274} [Fig- 
ure 2(b)]. Furthermore, for perturbation (32), SIe, suggest that (3, 7), (24, 7), 
(24,8), (227,11), (290,12), (290,13) and (109,12) are seven influential ob- 
servations, where for each i denotes the subject number and I denotes 
the observation number [Figure 3(a)-(b)]. 

5. Conclusion. We have introduced a local influence method to assess 
minor perturbations to a statistical model. Our method extends the previ- 
ous local influence method (Cook [7]) in several aspects. First, we propose 
to use the metric tensor of a perturbation manifold to select an appropri- 
ate perturbation to a model. The major advantage of using an appropriate 
perturbation is that it leads to a nice interpretation of the effect of all el- 
ements of a perturbation vector on a statistical model. We have shown in 
Sections 3.1-3.3 that most of the perturbation schemes considered in Cook's 
[7] examples are appropriate. However, we have also shown in several exam- 
ples, such as linear mixed models and testing parametric families, that some 
commonly used perturbations may not yield an appropriate perturbation; 
see Sections 3.4 and 4. Second, we have developed influence measures with 
nice geometrical interpretations for smooth objective functions at any point. 
The influence measures proposed here avoid the previous drawback that the 




02 Oi 06 OB I 12 

Normal curvature for perturbation (31 ) 



227 



109 




300 
days 



Fig. 3. Yale infant growth data: (a) rescaled Ce; t for perturbation (31) and SIe; l for 
the appropriate perturbation (32); (b) connected curves of SIe; , of five influential subjects 
{3,24,109,227,290} for the appropriate perturbation (32). 
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normal curvature is not well defined for some objective functions at points 
with a nonzero first derivative. In addition, the proposed second-order influ- 
ence measures reduce to normal curvatures for the likelihood displacement 
function (Cook [7]). Third, we have established a connection between the 
score test statistic and the FI measures; see Section 3.4. Finally, we have 
also examined a number of models to highlight the importance of choosing 
an appropriate perturbation and the broad spectrum of applications of this 
local influence method. 

Many issues still merit further research. One major issue is calculation of 
the influence measures and metric tensor under different situations, such as 
measurement error models (Carroll, Ruppert and Stefanski [4]; Zhong, Wei 
and Fung [36]), generalized linear models with missing data (Ibrahim, Chen, 
Lipsitz and Herring [13]), partially linear models (Zhu, He and Fung [41]) 
and structural equation models (Yuan and Bentler [33]). Another major is- 
sue is to establish relationships between the influence measures and other 
influence diagnostics, such as case-deletion measures and leverage (Cook and 
Weisberg [8] and Wei, Hu and Fung [30]). It is also important to develop 
appropriate influence diagnostics for detecting influential clusters in longitu- 
dinal data by taking into account the number of observations in each cluster 
and models used to fit the longitudinal data. However, the influence diag- 
nostics calculated in PROC MIXED of SAS 9.1 (SAS Institute Inc., Cary, 
NC) do not take into account the number of observations in each cluster; 
therefore, these influence diagnostics may give misleading results. We expect 
that the metric tensor of the perturbation manifold will play a critical role 
in this new development. 
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